Fuzzy Jaccard Index: A robust comparison of ordered lists

نویسندگان

چکیده

We propose Fuzzy Jaccard Index (Fuji) — a scale-invariant score for similarity assessment of two ranked/ordered lists. Fuji improves upon the index by incorporating membership function that takes into account particular ranks, thus producing both more stable and accurate estimates. provide theoretical insights properties as well an efficient algorithm computing it. also present empirical evidence its performance in different synthetic scenarios. Finally, we demonstrate utility typical machine learning setting comparing feature ranking lists, relevant to given task. In many practical applications, originating from high-dimensional domains, where only small percentage whole space might be relevant, robust confident leads interpretable findings, computation good predictive performance. such cases, correctly distinguishes between existing approaches, while being than benchmark scores.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a comparison of linguistic and pragmatic knowledge: a case of iranian learners of english

در این تحقیق دانش زبانشناسی و کاربردشناسی زبان آموزان ایرانی در سطح بالای متوسط مقایسه شد. 50 دانش آموز با سابقه آموزشی مشابه از شش آموزشگاه زبان مختلف در دو آزمون دانش زبانشناسی و آزمون دانش گفتار شناسی زبان انگلیسی شرکت کردند که سوالات هر دو تست توسط محقق تهیه شده بود. همچنین در این تحقیق کارایی کتابهای آموزشی زبان در فراهم آوردن درون داد کافی برای زبان آموزان ایرانی به عنوان هدف جانبی تحقیق ...

15 صفحه اول

Fuzzy Jaccard with Degree of Optimism Ranking Index Based on Function Principle Approach

Jaccard index similarity measure which applies the extension principle approach to obtain fuzzy maximum and fuzzy minimum has been proposed in ranking fuzzy numbers. However, the extension principle used is only applicable to normal fuzzy numbers and, therefore, fails to rank non-normal ones. Apart from that, the extension principle does not preserve the type of membership function of fuzzy num...

متن کامل

HyperMinHash: Jaccard index sketching in LogLog space

In this extended abstract, we describe and analyse a streaming probabilistic sketch, HYPERMINHASH, to estimate the Jaccard index (or Jaccard similarity coefficient) over two sets A and B. HyperMinHash can be thought of as a compression of standard logn-space MinHash by building off of a HyperLogLog count-distinct sketch. For a multiplicative approximation error 1+ on a Jaccard index t, given a ...

متن کامل

New Similarity Measures Between Generalized Trapezoidal Fuzzy Numbers Using the Jaccard Index

Similarity measures between generalized trapezoidal fuzzy numbers (GTFNs) are employed to indicate the degrees of similarity between GTFNs. Although several similarity measures of GTFNs have been proposed in the literature, none has considered using the Jaccard index. In general, the Jaccard index is a statistic used for comparing the similarity and diversity of sample sets. This paper presents...

متن کامل

Estimating Jaccard Index with Missing Observations: A Matrix Calibration Approach

The Jaccard index is a standard statistics for comparing the pairwise similarity between data samples. This paper investigates the problem of estimating a Jaccard index matrix when there are missing observations in data samples. Starting from a Jaccard index matrix approximated from the incomplete data, our method calibrates the matrix to meet the requirement of positive semi-definiteness and o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied Soft Computing

سال: 2021

ISSN: ['1568-4946', '1872-9681']

DOI: https://doi.org/10.1016/j.asoc.2021.107849